Unzipping DNA - towards the first step of replication 



o 

Q 



CO 



I 

C 

O 

o 



> 

&\ 
(N 
(N 



-a 
c 

o 
o 



X 
S3 



Somendra M. BhattacharjeeQ 
Institute of Physics, Bhubaneswar 751 005, India 

The opening of the Y-fork - the first step of DNA replication - is shown to be a critical phenomenon 
under an external force at one of its ends. From the results of an equivalent derealization in a non- 
hermitian quantum-mechanics problem we show the different scaling behavior of unzipping and 
melting. The resultant long-range critical features within the unzipped part of Y might play a role 
in the highly correlated biochemical functions during replication. 



DNA, the basic genetic material, is generally a very 
long, flexible, linear or circular molecule, with length 
varying from 2/im (5000 base pairs) for simple viruses 
to 3.5 x 10 7 /j,to (~ 10 11 base pairs) for more complex or- 
ganisms. In spite of the wide diversity in the information 
content and the functionalities of organisms, the general 
rules for replication have an astounding universality in 
the sense of independence of the system. A double heli- 
cal DNA can be made to melt (i.e. the two strands can 
be separated) in vitro (i.e. in the lab) by changes in pH, 
solvent conditions and/or temperature ("thermal melt- 
ing"), but they are found to be extremely stable in the 
cell. As per the standard dogma of molecular biology, 
every step of a biochemical process is mediated by an 
enzyme (encoded in the DNA), and the process "that ac- 
companies DNA replication requires enzymes specialized 
for this function: DNA helicases to disrupt the base pairs, 
topoisomerases to provide a swivel to unwind the strands, 
and single-strand binding proteins (SSB's) to coat and 
stabilize the unwound strands. Melting of the duplex at 
a replication origin for initiation, and at the fork during 
elongation, requires an expenditure of energy, an invest- 
ment justified by the functional and evolutionary benefits 
of maintaining the genome as a duplex DNA." [jjj 

Many biochemical details of the replication process are 
known, organism by organism. The proposal of a Y- 
shaped structure ^| for a linear molecule || as the start- 
ing point of replication seems to be corroborated by ex- 
periments Q , but as yet a proper analytical understand- 
ing is lacking. Very recently, with the advent of various 
physical techniques, attempts have been made to study 
the process from a physical, rather than a biochemical, 
point of view. Special attention has been given to the 
measurements of forces to unzip a double stranded DNA 
(dsDNA) molecule. Several studies have been made to 
understand the effect of forces in absence of any enzymes 
while, in other studies, enzymatic activities involv- 
ing expenditure of energy and displacement have been 
interpreted in terms of effective forces M. Similar to 
"transcription against applied force" of Ref. ||, experi- 
ments combining the mechanical opening of DNA with 
the enzymatic replication or transcription have been pro- 
posed j^], and such an experiment would be highly sig- 
nificant. The results of Ref. H distinguish the cases of 
conventional thermal melting and the unzipping which 



has been called directional melting ||. Our analytical 
approach shows the differences between these two cases. 

Some of the quantitative questions are the following: 

(1) Is there a critical force to open up a double-stranded 
chain in thermal equilibrium, acting say at one end only? 

(2) Is the nature of the transition different from the ther- 
mal melting of the bound pair and is it reflected in the 
opened region? Once these equilibrium questions are un- 
derstood, we may ask (3) how the pair opens up in time 
after the force is applied - the question of dynamics of 
unzipping. 
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FIG. 1. (a) Pulling the two strands at one end. (b) A 
"denatured" bubble due to thermal fluctuation ("melting"), 
(c) an unzipped region slightly below the critical threshold 
("directional melting"). In (b) and (c), shaded parts denote 
bound regions. 



In order to focus on the effect of the pulling force (see 
Fig. 1) on the bound strands, we take the viewpoint of 
a minimal model that transcends microscopic details but 
on which further details can be added for a realistic sit- 
uation. Our approach differs from the previous studies 
[]To| — p~2|| in the emphasis on the opening of the fork. We 
treat the DNA as consisting of two flexible interacting 
elastic strings. Winding is ignored. In a sense, the effect 
of topoisomerases is built into the model by maintaining 
the chains in the native state. Such a model has been 
found to be useful for many properties of DNA [|l3] |l7| . 
For quantitative results we consider a bound situation as 
obtained by a binding square-well potential. Our focus is 
on the simpler aspects of the theory, and here we answer 
the first two questions posed above. 
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Our model is this: Two gaussian polymer chains |lq ] 
in d = 3 dimensions Jl9| , with r denoting the position of 
a monomer along the contour of a chain, are tied at one 
end and pulled by a stretching force g = ge g at the other 
end (see Fig. 1), e g being the unit vector in the direction 
of the force. The energy from the force is proportional 
to the separation r(N) of the end points at r = N, i.e. 



r(N) 
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drjr) 
dr ' 



(1) 



since r(0) = 0. Assuming identical interaction for all 
base pairs ( or for a DNA with identical base pairs) the 
Hamiltonian can be written in the relative coordinate BCfl 



H 
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= I dr 



dr(r) 
dr 



dr(r) 
dr 



V(r(r)) 



(2) 



where r(r) is the ci-dimensional relative coordinate of the 
two chains at the contour length r. Base pairings require 
that the monomers on the two strands interact only if 
they are at the same contour length r. The potential 
energy is therefore given by the integral of the poten- 
tial V / (r(r)) over t. V(r) is a short-range potential and 
its detailed form is not important. One choice, as gener- 
ally used for renormalization group approach, is to take a 
contact potential V(r) — voSa^) where, in Fourier space, 
^a(q) = 1 for | q |< A, A being a cut-off reminiscent of 
the underlying microscopic structure. For the equivalent 
quantum mechanical calculation, we choose a square- well 
potential. In thermal equilibrium, the properties are ob- 
tained from the free energy F = —ItBThiZ where the 
partition function Z = J T>lZexp(—H/kBT) sums over 
all the configurations of the chains. 

The Hamiltonian written in the above form can be 
thought of as a directed polymer in d + 1 dimensions 
about which many results are known [j2l|j22[ for g = 0. 
If we treat r as a time like co-ordinate, then the same 
Hamiltonian represents, in the path integral formulation, 
a quantum particle in imaginary time. This quantum 
particle with g ^ then corresponds to the imaginary 
vector potential problem much discussed in recent times 
p3~f . We make use of both the pictures in this paper and 
follow the formulation of Ref. [^3| closely. 

For d = 1, the quantum problem with V(x) = —vq5(x), 
is exactly solvable, and is done in Ref. [^3) as a single im- 
purity problem. It was shown that there is a critical g c 
below which the force does not affect the bound-state en- 
ergy, i.e. the quantum particle remains localized near the 
potential well, while for g > g c the particle delocalizes. 
In the polymer picture, this means that in low dimen- 
sions, a force beyond a critical strength separates the 
two strands. It should be pointed out here that the force 
is applied at one end only, and but it is not a boundary 
or edge effect mainly because of the connectivity of the 
polymer chain as expressed by Eq. |l|. 



Details of the phase transition behavior of the Hamil- 
tonian of Eq. U for 5 = are known from exact renormal- 
ization group (RG) calculations |^l|,^2| for a 5-function 
potential. With g = 0, there are two fixed points (fp) at 
(i)u* = 0, and (ii) u* = e, with e = 2 — d and u = vL c 
as the dimensionless running potential strength param- 
eter, L being an arbitrary length-scale. For d < 2, the 
first one is unstable and hence the critical point while for 
d > 2(e < 0), the second one is the unstable one. The 
unstable fp (u*) represents the melting or unbinding of 
the two chains. It also follows |2^] that other details |24|] 
of V(r) are irrelevant in the RG sense, i.e., they vanish 
in the large length-scale limit, explaining the universality 
of the problem. 

The important length-scales for this critical point, 
from the bound state side, comes from the typical size 
of the denatured bubbles of length r m along the chain 
and £ m in the spatial extent (Fig. lb). Close to the crit- 
ical point these length scales diverge as £ m ~| Au \~ Vm 
and T m ~ where Am is the deviation from the critical 
point, where u m and £ are the two important exponents 

ill, 



u m = l/\d-2\ and £ = 2. 



(3) 



It is this £ (the dynamic exponent of the quantum prob- 
lem) that will distinguish the new phenomena we are try- 
ing to understand P5| . 

The mapping to the imaginary time quantum mechan- 
ics allows us to use the methods of quantum mechanics 
p3[ . As mentioned, we choose a square- well potential 
V(r) = — Vb, for r < ro, and 0, otherwise. The quantum 
Hamiltonian is then given by [|23| 
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in units of % = 1 and mass m — 1, with p as momen- 
tum. For simplicity, the well is chosen to be just deep 
enough to have only one bound state (in the quantum 
mechanical picture, with g = 0) with energy Eq < 0. 
The non-hermitian Hamiltonian can be connected to the 
hermitian Hamiltonian at g — by 

U- 1 H q (g)U = H q (g = 0), where U = exp(g • r). (5) 

The wave-functions are also related by this U- 
transformation so that if the transformed bound (i.e. lo- 
calized) state wave-function remains normalizable, the 
bound state energy will not change. We refer to Ref. |23[ ] 
for details of the argument. The continuum part of the 
spectrum will have the minimum energy E = —g 2 /2 (the 
state with wave- vector k = 0) . For the localized state at 
g = 0, the wave-function for r > r is ipo(r) ~ exp(— nr) 
where n = 2y| Eq |. The right eigenvector for H q (g) 
is then i/jr( t ) ~ ex P(g ' r — Kr )> obtained via the U- 
transformation. This remains normalizable if g < k, so 
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that the binding energy remains the same as the 5 = 
value until g = g c = k. The generic form of the spec- 
trum is shown in Fig. 2a. This indicates a derealization 
transition by tuning g - the unzipping or the directional 
melting of DNA at g = g c . The phase diagram is shown 
schematically in Fig. 2b. There is a gap in the spectrum 
(Fig. 2a) for g < g c and the gap vanishes continuously 
as | g 2 — n 2 |~| g ~ g c | as g — ► g c — . Since the time in 
the quantum version corresponds to the chain length, the 
characteristic chain length for the derealization transi- 
tion is therefore p6j ] 

7dm (5) H g c - 9 T 1 ■ (6) 

The spatial length-scale of the localized state is deter- 
mined by the width of the wave- function and, for g = 0, it 
is set by For g ^ 0, the right wave-function,'!/?/}, has 
a different length scale and this length-scale diverges as 
the wave-function becomes non-normalizable. The width 
of ipR gives this scale as 

£dm(ff) H 9c - 9 \~ VAla with l/ dm = 1) (7) 

for g — » g c — . We find Td m ~ £dm and therefore 

C = 1. (8) 




FIG. 2. (a)The energy spectrum for g 7^ for the Hamil- 
tonian in Eq. ^. (b) The phase diagram in the u — g plane. 
Thermal melting takes place along the g — line at u — u«. 
The hatched line indicates the unzipping transition or direc- 
tional melting line. The biochemical reactions is conjectured 
to occur at a slight subcritical region indicated by the hatched 
region. 

The significance of Td m can be understood if we study 
the separation of the two chains, i.e., (r) T , at a distance 
r along the chain below the pulled end. This can be eval- 
uated by using the standard rules of quantum mechanics 
. For infinitely long chains in the sub-critical region, 
only the bound and the first excited states are sufficient 
for the computation. One finds, along the pulled direc- 
tion, 

(r) T ~ exp[-r/r dl n(ff)], (9) 

where Td m (g) is given by Eq. |6[ In other words, Td m (<?) 
and £dm(<7) describe the unzipped part of the two chains 



near the pulled end (Fig. lc). These length scales diverge 
with £ = 1 as the critical force is reached from below. 
(One can also define similar length-scales for g — > g c + 
where the lengths would describe the bound regions.) 
The exponential fall-off, Eq. |9|, of the separation from 
the pulled end immediately gives the picture of the repli- 
cation Y-fork as shown in Fig. lc. 

Let us study the behavior of the free energy. Since 
the partition function for the Hamiltonian of Eq. |^ 
obeys a diffusion-like equation [ p~8[ , the free energy T = 
F/ksT — g 2 r/2 satisfies the equation 

^ = ^f 2T \^ T ) 2 -S-y^F- v S A (r). (10) 

The left hand side represents the free energy per unit 
length of the chains. Under a scale transformation 
x — > bx, and r — > b^r, the total free energy remains 
invariant so that the above equation takes the form 

= l _ h C-2 V 2 T _ 1 6 C-2 (V ^ )2 _ 

dr 2 2 v ; 

b^g-VT-b^voSAir). (11) 

For g — 0, Eq. [ll] tells us that ( — 2 and d — 2 are 
special for melting transition as we see in Eq. | . For the 
choice C — 1) the g-dependent term dominates and all 
other terms become irrelevant for large length-scale b. It 
is this feature that shows up in the Y-fork of the unzipped 
chain. The robustness of Eqs. also follows from this. 
With the dependence on the potential strength entering 
only through g c , these are valid along the hatched line 
of Fig. 2b, and could be oblivious to the details of the 
nature of the melting transition. In our simple model, 
the melting point appears as a multi-critical point in the 
phase diagram of Fig. 2b. 

Now we see the striking difference between the direc- 
tional melting and thermal melting (Fig. 1). In the ther- 
mal case the bubbles will have anisotropic shape with the 
spatial extent scaling as the square-root of the length 
along the chain. This is the characteristic of the fixed 
point it*, while the pulled case is described by a differ- 
ent scaling. The exponential profile of the Y-fork and 
its scaling are the two important characteristic features 
which should be experimentally verifiable. 

We have considered only the equilibrium situation. 
The dynamics of this process of directional melting or 
unzipping is important. Similarly the condition of iden- 
tical interaction may not be realistic for real DNA. This 
feature can be cured by taking the interaction vq to be 
random p7| with a specific distribution. Such a ran- 
dom case J27| shows a different type of melting behav- 
ior. The mapping to a quantum problem is also then 
lost. Self-avoidance and other topological constraints 
can also be added to this model, though at the cost of 
the simplicity of the model. For example, self-avoidance 
can be introduced in Eq. by adding |lq] a term 
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5 / At J dr'u s (5(r(r) - r(r')), with w s > 0. The general- 
ization to a two-chain problem is straight-forward. Such 
a term can be "unsquared" by introducing ps|] an an- 
nealed gaussian-random potential V[(r) with zero mean 
and variance (Vi(r)Vi(r')) = v s S(r~ r') so that the equiv- 
alent quantum hamiltonian is H q = H q + iV\{v), where 
H q is given by Eq. |J This involves both an imaginary 
vector potential and an imaginary, random, scalar po- 
tential. The polymer problem is recovered from the V\- 
averaged propagator of the quantum particle pi| . Such 
a general non-hermitian hamiltonian is little understood 
at present even for the pure case, let alone the random 
one. We wish to come back to these issues in future. 

Let us summarize our results and the emerging picture. 
There is a critical strength of the force required to un- 
zip a double stranded DNA, and this directional melting 
is a critical phenomenon. Thermal or other fluctuations 
can open up regions along the length of the chain (bub- 
bles with £ = 2) but the unzipped part is characterized 
by a different scaling (£ = 1), as shown in Fig. 1. In 
other words, the Y-fork created by the force represents 
a correlated region which is easily distinguishable from 
a thermal bubble. Living organisms probably work at a 
slightly sub-critical regime with g < g c to take advantage 
of and exploit the distinct correlations for the enzymatic 
actions within the Y-fork. This can be achieved by cou- 
pling the biochemical reactions (like polymerization, un- 
winding etc) to the unzipping phenomenon, analogous, 
in spirit, to reaction-diffusion systems. The near-critical 
features of unzipping or directional melting then can lead 
to a coherent phenomenon we see as replication - a pro- 
cess requiring highly cooperative functioning of different 
enzymes in space and time. This way of viewing the cor- 
related biochemical events has not been studied so far. 
As a first step, we therefore suggest that high precision 
measurements be done to get the profile along the un- 
zipped region of DNA (or simpler double-stranded poly- 
mers) under a pulling force, in vitro or in vivo. 
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